Programming Animated Statistical Graphics with R

Author

Erica Chan

Published

January 26, 2026

Modified

February 12, 2026

1 Overview

1.1 Learning objectives

  • Learn how to create animated data visualisations using gganimate and plotly r packages.
  • Learn how to reshape data with tidyr package
  • learn how to process, wrangle and transform data with dplyr package.

1.2 Basic concepts of an animation

When creating animations, the plot does not actually move. Instead, many individual plots are built and then stitched together as movie frames, just like an old-school flip book or cartoon. Each frame is a different plot when conveying motion, which is built using some relevant subset of the aggregate data. The subset drives the flow of the animation when stitched back together.

1.3 Terminology

Before we dive into the steps for creating an animated statistical graph, it’s important to understand some of the key concepts and terminology related to this type of visualization.

  1. Frame: In an animated line graph, each frame represents a different point in time or a different category. When the frame changes, the data points on the graph are updated to reflect the new data.
  2. Animation Attributes: The animation attributes are the settings that control how the animation behaves. For example, you can specify the duration of each frame, the easing function used to transition between frames, and whether to start the animation from the current frame or from the beginning.

2 Getting started

2.1 Loading R packages

First, the following code chunk will check, install and load the following R packages:

  • plotly, R library for plotting interactive statistical graphs.
  • gganimate, an ggplot extension for creating animated statistical graphs.
  • gifski converts video frames to GIF animations using pngquant’s fancy features for efficient cross-frame palettes and temporal dithering. It produces animated GIFs that use thousands of colors per frame.
  • gapminder: An excerpt of the data available at Gapminder.org. We just want to use its country_colors scheme.
  • tidyverse, a family of modern R packages specially designed to support data science, analysis and communication task including creating static statistical graphs.
pacman::p_load(readxl, gifski, gapminder,
               plotly, gganimate, tidyverse)
remotes::install_github("wilkelab/ungeviz")
Using GitHub PAT from the git credential store.
Skipping install of 'ungeviz' from a github remote, the SHA1 (74e1651b) has not changed since last install.
  Use `force = TRUE` to force installation

2.2 Importing the data

In this hands-on exercise, the Data worksheet from GlobalPopulation Excel workbook will be used.

The next code chunk will import Data worksheet from GlobalPopulation Excel workbook by using appropriate R package from tidyverse family.

col <- c("Country", "Continent")
globalPop <- read_xls("data/GlobalPopulation.xls",
                      sheet="Data") %>%
  mutate(across(col, as.factor)) %>%
  mutate(Year = as.integer(Year))
Warning: There was 1 warning in `mutate()`.
ℹ In argument: `across(col, as.factor)`.
Caused by warning:
! Using an external vector in selections was deprecated in tidyselect 1.1.0.
ℹ Please use `all_of()` or `any_of()` instead.
  # Was:
  data %>% select(col)

  # Now:
  data %>% select(all_of(col))

See <https://tidyselect.r-lib.org/reference/faq-external-vector.html>.
glimpse(globalPop)
Rows: 6,204
Columns: 6
$ Country    <fct> "Afghanistan", "Afghanistan", "Afghanistan", "Afghanistan",…
$ Year       <int> 1996, 1998, 2000, 2002, 2004, 2006, 2008, 2010, 2012, 2014,…
$ Young      <dbl> 83.6, 84.1, 84.6, 85.1, 84.5, 84.3, 84.1, 83.7, 82.9, 82.1,…
$ Old        <dbl> 4.5, 4.5, 4.5, 4.5, 4.5, 4.6, 4.6, 4.6, 4.6, 4.7, 4.7, 4.7,…
$ Population <dbl> 21559.9, 22912.8, 23898.2, 25268.4, 28513.7, 31057.0, 32738…
$ Continent  <fct> Asia, Asia, Asia, Asia, Asia, Asia, Asia, Asia, Asia, Asia,…
  • read_xls() of readxl package is used to import the Excel worksheet.
  • mutate(across(col, as.factor)) uses dplyr functions to convert data from the columns defined in col into factor type.
    • mutate_at(col, as.factor) could also be used to derive the same output
  • mutate of dplyr package is used to convert data values of Year field into integer.
  • glimpse() provides a transposed overview of the data to check that all the above steps transformed the data as expected

3 Animated Data Visualisation: gganimate methods

gganimate extends the grammar of graphics as implemented by ggplot2 to include the description of animation. It does this by providing a range of new grammar classes that can be added to the plot object in order to customise how it should change with time.

  • transition_*() defines how the data should be spread out and how it relates to itself across time.
  • view_*() defines how the positional scales should change along the animation.
  • shadow_*() defines how data from other points in time should be presented in the given point in time.
  • enter_*()/exit_*() defines how new data should appear and how old data should disappear during the course of the animation.
  • ease_aes() defines how different aesthetics should be eased during transitions.

3.1 Building a static vs animated population bubble plot

ggplot(globalPop, aes(x = Old, y = Young, 
                      size = Population, 
                      colour = Country)) +
  geom_point(alpha = 0.7, 
             show.legend = FALSE) +
  scale_colour_manual(values = country_colors) +
  scale_size(range = c(2, 12)) +
  labs(title = 'Year: {frame_time}', 
       x = '% Aged', 
       y = '% Young') 

The code chunk above uses basic ggplot2 functions to create a static plot

ggplot(globalPop, aes(x = Old, y = Young, 
                      size = Population, 
                      colour = Country)) +
  geom_point(alpha = 0.7, 
             show.legend = FALSE) +
  scale_colour_manual(values = country_colors) +
  scale_size(range = c(2, 12)) +
  labs(title = 'Year: {frame_time}', 
       x = '% Aged', 
       y = '% Young') +
  transition_time(Year) +       
  ease_aes('linear')  

This code chunk uses:

  • transition_time() of gganimate to create transition through distinct states in time (i.e. Year).
  • ease_aes() to control easing of aesthetics. The default is linear. Other methods are: quadratic, cubic, quartic, quintic, sine, circular, exponential, elastic, back, and bounce

4 Animated Data Visualisation: plotly

In Plotly R package, both ggplotly() and plot_ly() support key frame animations through the frame argument/aesthetic. They also support an ids argument/aesthetic to ensure smooth transitions between objects with the same id (which helps facilitate object constancy).

4.1 Building an animated bubble plot: ggplotly() method

In this sub-section, we will create an animated bubble plot by using ggplotly() method.

This bubble plot includes a play/pause button and a slider to control the animation

gg <- ggplot(globalPop, 
       aes(x = Old,  
           y = Young, 
           size = Population, 
           colour = Country)
       ) +
  geom_point(aes(size = Population,
                 frame = Year,
                 ids=Country),
             alpha = 0.7, 
             show.legend = FALSE) +
  scale_colour_manual(values = country_colors) +
  scale_size(range = c(2, 12)) +
  labs(x = '% Aged', 
       y = '% Young') + 
  theme(legend.position='none')

ggplotly(gg)
  • Appropriate ggplot2 functions are used to create a static bubble plot. The output is then saved as an R object called gg.
  • ggplotly() is then used to convert the R graphic object into an animated svg object.
  • theme(legend.position='none' is necessary although show.legend = FALSE was used as otherwise, the legend will still appear on the plot.
  • the warning “Ignoring unknown aesthetics: frame” is because the frame argument is used by ggplotly(), not by ggplot2

4.2 Building an animated bubble plot: plot_ly() method

In this sub-section, we will create an animated bubble plot by using plot_ly() method.

bp <- globalPop %>%
  plot_ly(x = ~Old, 
          y = ~Young, 
          size = ~Population, 
          color = ~Continent,
          sizes = c(2, 100),
          frame = ~Year, 
          text = ~Country, 
          hoverinfo = "text",
          type = 'scatter',
          mode = 'markers'
          ) %>%
  layout(showlegend = FALSE)
bp